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1. Introduction : 

Let and be measureable spaces and let T : X Y be 

surjective and measureable. Let "7^ be a set of finite positive measures on 
(X,(2). For each p e"W there corresponds a measure pT ^ on (Y, defined 
for F e by 

PT’^(F) = p(t“^(F)). 


If f is a p-integrable real valued function on X, then as a consequence of 
the Radon Nlkodym Theorem, there is a pT integrable function e^(f) on Y 
satisfying 


/ e (f)dUT“^ = 
F m' 


/ fdp 
r-(F) 


for each F e^. Clearly is defined only up to sets in Y of pT ^ 

measure 0 and f = g a.e. (p) implies e (f) = e (g) a.e. (pT ^) . The 

P P 

linear operator e^ defined as above maps the space ,p) to the space 

,pT and is called the conditional expectation operator. Its value 


2 . 


e^(f) at f <2 ,u) is called the conditional expectation of _f given 

!• 

The conditional probability of an event E £ is defined as 

’ <=u^Xe> 

where Xp is the indicator function of E, The conditional probability 
£* 

functions satisfy 

(a) uT"^). 

where ^(Y, ^ ,yT is the set of all real valued ^-neasureable functions 
on y, with equality defined as equality a.e. (pT ^). 

(b) For each F £ ^,E G Q, , 

V(E n T~^(F)) = / P (E)dpT“^ 

F ^ 

(c) 0 < P (E) < 1 for each E G <3 and P (X) = 1. 

M M 

-00 

(d) If lE } , is a disjoint sequence of events in (Jt , 

n n=i 

P ( u, E ) = P (EJ a.e. (pT'^) . 

U n=l a n»l U n' 

It should be noted that satisfies property (c) even when p is not a 

probability measure. 

The transformation T is called a sufficient statistic forVi if 

for each E G (2 there is a -measureable function P(E) on Y such that 

-1 -1 

for each p £171* a.e., (pT ). The setTi^ls dominated by 

X (perhaps not for each p , p is absolutely 


a measure 


coatlnta»«@ with respect to X»( %rrltten V « X.y'fTf is hoTOgen«K>us if It is 
dominated by each Of its t^^eifs. A assure X is univalent to 'fH if 
X dmlnates 'fft and u(^) * 0 for e*ch y c7H ij^lies X(E) ■ 0. 

The notation and terminology used in this paper are taken from (tUlsMS 
and Savage; 1949) » as are the follmring three theor^as. The notation 

of 

Radon-Nikodym derivatives which is T neasureable. 

Theorem It If ‘p1tt is dominated, then a statistic T is sufficient for if 
and only if there exists a measure X equivalent to such that for each 

U ||(f )t“^(-^. 

Theorem 2: If is dominated, then a statistic T is sufficient for ^ if 

and only if T is sufficient for each pair {u,v} of elements of . 

Theorem 3: If "Tft is homogeneous, then a statistic T is sufficient for if 

and only if for each M,v £"7/1 ‘ 

2. Honmigeneous Families : 

Henceforth, we will assume that is homogeneous. Let denote the 

cone generated by }?l , excluding the zero measure. That is, C(ff6 is the set of 
all finite linear combinations, with strictly positive coefficients, of elements 
of "Tn . Elements of CC?/0 are termed mixtures of elements of Clearly, 

C(?;j) is also hoim>geneous ; hence, the spaces ^(T,'^»pT ) are all the same 

for y e COfO and may be denoted simply by^ . For y e C()^, maps CX to 

^ and it is clear from the definition of a sufficient statistic that T is 
sufficient for a subset XZ of if and only if the conditional probability 


means that there is an element of the equivalence class ^ 
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ftmcciofis for y e "H, are all equal. 

Leaaa 4r If'fft Is doalnated* and T is sufficient for then 
T Is sufficient tor 'ft, • 

Proof? Let X he that aieasure equivalent to tdiose existence is assured 

by Theorem 1. If y e then y can be written 

k 

y - E 


1=1 


'i '1 


with B. > 0 j V effl ior i = l,...,k. Hence, 




Thus T is S’^fficient for CO?D and hence Is sufficient for^. 

In order to characterize sufficient statistics for CC?f|, it suffices, 
by Theorem 2, to consider a pair 

- lii ''i 

and 

in 7^ » where I and J are finite sets; B. > 0 for k c TuJ; and the 
measures {y.}< y distinct members of / ’^, as are the measures {y.} 

The set C( ) of all finite mixtures of elements of f'J't is said to be 
identifiable (Teicher^ 1960, 1961; Yakowitz 1969) if each element of Ci/td 
can be expressed in only one way as a linear combination with positive 
coefficients of elements of 7^, except for the order of the sumnands. Equivalently, 
Cr)n) is identifiable if the set 7^ is linearly independent over the real numbers. 
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Hie concept o£ Identlflabillty is very Itaportant in establishing the 
imiqueness and consistency of various estiiaators of the so called mixlnR 
parameters {6^ siel} in a mixture Uj (Yakcnrita, 1969). 

Given a mixture in COH) we have for each E eC? , F e 

= Z $.y. (E n T"^(f » 
i£l 



<E) Ti . 

F dl , T 


Let be the equivalence classes in I modulo the relation i = k if 

and only if P * P ; that is, if and only if T is sufficient for the 

^i \ 

pair {y^, y^^}. Then we have 


dy,T 


rl 




.-1 


dy^T 


I 

-1 

2 . «* / \ 1 m 


Ui 1?I -‘>'1^ • 


dy,T 


where P (e ) is the common value of the P (E ) for i c I*. 


r dui 

^y “ -1 ^u- 

I ^ ^ dp^T ^ 


Thus, 



6 


where Ut is the alxture 

Jt 


\ ' ■ 

Whenever the conditional probability function of a mixture is 

written in this fashion with being equivalence classes aradulo the 

relation =, we will say that P.. is written in normal form. 

Ml 

Definition 5 : The set CO)0 i® conditionally Identifiable with respect to 


the statistic T if for each pair } in C{1)0 ^dienever P * P and 

I j * 


M, 


M, 


are expressed in normal form 


.“1 


il-1 


M. 


yj k-1 


dy^T 
dy T 


,-l 


,-l 


dy T 
J 


-1 


\ > 


then r = s and for each il * l,,..,r there exists exactly one k * l,...,r 
dyjT*^ 

such that ^ — r— = — and P,, = P,, . The set CCTfO is 

dy^ T-1 dy^ T“^ % 

marginally identifiable with respect to T if the set {yT ^|y is 

linearly independent over the real numbers. 

Theorem 6 : If Ctlfj) is both marginally identifiable and conditionally 

identifiable with respect to a statistic T, then is identifiable. 

Proof ! Suppose y_ - Z B.y. ■ Z B.y. ■ y. . where the measures in each 

^ iel ^ ^ jeJ J J 

sura are distinct members of }f[ . 1hen» expressed in normal form. 
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dM. T 
£ 


-X 


r 

Si 1 

dMjT-l 


du, T"^ 
r '^J 

2 ...... ,X 

^“1 ^ ^-1 
dMjT 


P. - P , 
Mi M * 


and ve my assume without loss of generality that 


dMil"^ du I- 

L _ L -, 

dHjT’^ dPjX”^ 


and Pjij = PMj for £ ■ l»..*»r. 

£ £ 

-1 -1 -1 -1 
Since "* M,T » it follows that M_ T = y T . For i,k e la, 

I J ^£ 

U^T ^ 4 Mj^T for otherwise, since Py^ “ PMj^, we would have ■ y^, 

contradicting the assumption that {y^ : i£l} are distinct. Similarly, the 
y.T ^ for j e J are all distinct. Since C(77i) is marginally identifiable, 

•I Ki 

I „ and J „ have the same number of elements and for each 1 £ I a there is 
a unique j(i) e such that “ ^J(i) ^i^ ^ * ^j(i)^ 

P = P , it follows that y » y.,.% for each i E Therefore, 

Mi i JU7 X. 

there is one to one map j from I onto J such that 

y^^j “ y^ for each i e I. Hence, C(7^) is identifiable, and the proof 
is complete. 

For conditionally identifiable sets of measures, the following theorem 
and its corollary provide some characterizations of sufficient statistics. 

Theorem 7: If 77^ is homogeneous, C(7^) is conditionally Identifiable 

with respect to a statistic T, and Mj.Mj are in C(7f0t then T is 

sufficient for the pair y^, Pj if and only if there exist partitions 

I » I, u ... ul and J » J, u ... uJ such that for each £ * l,...,r‘ 

1 r 1 r 
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(a) 


d( r e, liJ/ d( 3, yj 


► j Hi 

ietl 1 1 


1 J «V 


dii ij^ . % 

dMt 


and 

(b) T is sufficient for the set s k e Ij|^uJj^}. 

Proof ; First suppose such partitions exist. By (b) T is sufficient for the 
set N. and hence » by lemma 4, it is sufficient for the pair {y_ ,y. }* It 

h 

follows from (a) and Theorem 3 that T is sufficient for the pair {y^,yj}. 

S'ippose that T is sufficient for the pair {yj,Mj}. Then* expressed in 

normal form, . . 

^ dyj t"-^ ^ 

1^1 ■ H-l du^ '“Jt* 

and we may asstsue without loss of generality that 


dy- T"^ dy, T"1 

I I . « 

• — IT “ — rr 

dy T ^ dy T ^ 


for each f . 


The condition P ■ P is equivalent to (b). By Theorem 3* there exists a 

’'H '‘h 

dUl .1 - du T"'- 

representative f e which is T (o) measureable. If g e ^^ppl* 

3 3 

then g#T is T ^(•^) measureable and for each F c , 


I g*T dy^ «* 
T-l(p) 


/ g dy,f 
F 


-1 


y^T *(F) 


/ f dyj 
X-l/F) 
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It follows that g«T “ f a.e.(vtj). Thus, 


dU du,T"^ 

— i— . T - {g*T I g e -^4- 

A iM A 


} c 


dMjT 


dMjT 


dMi 


Since T is also sufficient for the pair {y_ ,y. }, a similar argument 

gives 


dPT T 
£ 


,-l 


d|i. 


dy, T 


-1 


. T c 


dy. 


dpj T 

for each £. Since — ^ 


rl 


dHjT 


-1 




for each £, it follows that (a) 


holds for each £ and the proof is complete. 

Corollary 8 ; If TW- is hof^sgeneous and Ctfft) is conditionally identifiable 

with respect to a statistic T, then T is sufficient for a pair 

in C(tM) if and only if there exist subsets I and c J such that; 


(a) 


and 


dy, dy. 

^1 - -JL 



(b) T is sufficient for N ■ {y^^ ; k e u J^^}. 


Proof; That T sufficient liq>lies the existence of and satisfying 

(a) and (b) is ii^diate from Theorem 7. Conversely if 1^^ aiui satisfy 
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(a) and (b>* then T ia sufficient for , y. by (b) and hence, by (a), 
T is sufficient for 

Given a pair of fixtures y^, Pj in we will call their 

dy, 

likelihood ratio — ^ indeeoiaposable if cJ and 


dyi, dy. 

^ l»ply 1, ■ I and J ,■ J. It is clear fro® Theorem 7 that 

‘“'j, 'ii'j ^ ' 1 1 

If 0(7)1) is conditionally identifiable with respect to T and a pair of 
mixtures y. , y. in C(IH) have an Indecomposable likelihood ratio, then 

X w 

T is sufficient for {y^, yj) if and only if it is sufficient for 

{u, ; k e I u T}. Also, it is not difficult to see that for each pair 
k 

\i t y in C(tH) there exist nonempty stdisets = i s“d J. = J such 


and the likelihood ratio 


is Indecomposable. If y^ and yj represent 

•^1 


the probability laws for two alternative hypotheses « t^en there would be two 
advantages in being able to identify subsets and satisfying these 

two criteria. First* the maximum likelihood decision procedure would be simplified, 
and second, the search for a statistic sufficient for deciding between the two 
hypotheses and having the property that CC77I) is conditionally identifiable 
could be restricted to those statistics sufficient for m J^). 


3. Sufficient Linear Statistics for Mixtures of Itonaals; 


If ^ is a subring of the ring ^ introduced in Section 2, then with the 
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usual definition of addition and suit Iplicat ion by elements of 
of all functions ^ i d * is a module over ^ . Thus, it is natural to 
cons ider independence of a set of such functions. To be precise, «/ is 
^independent if whenever <K,...,0 is a finite set of distinct elements of 

A 9 

^ and Y| t • • • ,y are elements of ^ such that 


(e) E) * 0 for each 

i 1 9 m 

then Y-," ... ■ Y ■ 0. If X * subring of 'J which contains all tae 

“ -1 
duT * 

bound i don-Nikodym derivatives rj for p, v c O^fTO , then it is clear 

dyT 

that 7\ -independence of the set {P^ : v c'T^} implies that C(7H) ia 
conditionally identifiable with respect to T. 

For the remainder of this section we will assume that X is V is lh. ^ 

(k S n) and T ; X -» Y is linear and full rank. Cl and are respectively. 


n It 

the Borel fields on and . We also assume that each y f is described 

by a normal density 
for each E e Q, , 


bv a normal density function f with mean m and covariance fi . That Is, 
' W M U 


M(E ) - / f, ,d^ . 

E ^ " 

where is Lebesgue measure on 

By a suitable choice of the coordinate system, we may represent the densities 
f^ as Joint density functions f^(y,z) on ^ while representing T 

as the projection T(y, 2 ) • y. Then the marginal densities 


B^.(y) • / f (y«z)dz 
^ n-k 

IR ^ 

are normal with means Tm^ and covariance matrices TQ^T (Anderson, 1958). 


The conditional density ftmctlons 


h^(z 1 y> = 


g^(y) 


,n-k 


are normal as fwictlons of s c with aeitts 

( 1 ) 


Sm + Sfi T^Cia T^)"*^(y - Tte') 
]i mu' W 


and covariances 


( 2 ) 


sn^s^ - sn^T^cm^T^)"^ 


where S is the linear operator S(y,z) « z. The conditional probabilities 
P ( E ) are represented by 


E ly) = / h^(zly)dz 
S/E) 


where S^C E ) = {z e ^ 1 (y»z) £ e )• 


n 


Theorem 9 : If *)7t is a family of Borel measures on ^ given by n-variate 

n k 

normal density functions and T ; IR linear of rank k, then 

C(7?t) i® conditionally identifiable with respect to T. 


Proof ; It can readily be verified that conditional ident if lability of 

is not affected by the change of variables just described. If and y* 

d„ T-l ^ ■' 


are in COfO , then the Radon-Nikodym derivative 


dPjT 


,“1 


is represented by a 


function of' the form 
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i.e.* a ratio of aixtyres of k-varlate w>n&al density functions, %rhich 

is continuous. Hence, by the resarks in the first paragrai^ of this section, 

it suffices to show that the set of conditional det»lty 

ftmctlons is /^-independent, irtiere ^ is the sid>ring of ^ consisting of those 

eleaents of ^ %dilcdi have a continuous representative. To this end, let 

P be distinct and let v, he cont .hk>us real valued 

y '1 r 

k 

functions on St such that for each "S, £■ (L t 


Yi<y)Py^( Ely) +...+ Yj.<y)P^ (e ly) = o 

for almost -I y. In particular, choosing for E sets of the fora ^ * k, 

where K i:>^ a borel set in , we have 


Y, (y) / h (zly)dz +. ..+ Y_(y) / h (zly)dz = 0 
K ^1 K ^r 


for almost all y. For each K, / h^^ (zjy)d 2 is a continuous function of 
y. Hence, 


K ^ 


/ (Y.(y)h (ziy) +...+ Y (y)h (zlyl)dz = 0 
K ^1 *^r 


for each y e ^ . It follows that 


Y (y)h (zly) +...+ Y_(y)h (z|y) = 0 

1 Ml ^ 

for each y e z ^ (R*' ^et f be the set of y G where two or 

UKjre of the conditional density functions h (zly) are equal as functions 
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of 2 . It is easily seen from (1) and (2) that the Lebesque neasure of F is 

zero. For "it F , {h (•ly)*...,h ( -ly)} is a set of distinct noraal 

^i 

density functions of z. Hencet (Yakowitz and Spragins; 1968) > they are 
linearly Independent over the real ntnri>ers. Therefore, for y ^ F , 

Yj^(y) = ... ® Yj.(y) * 0* Tliat is, y^ * * 0 as el^tents of ^ , 

Thus, CC^D is conditionally identifiable. 

If is in C(trt)» then has a density function 


= E 6. f 
iel 1 


which is a mixture of normal density functions. The following theorem is an 
immediate consequence of Theorems 7 and 9. 

Theorem 10 ; Given the assumptions of Theorem 9, the statistic T is 

sufficient for a pair {p^, Pj) in if and only if there exist partitions 

1 = 1, u...ul and J = J, u...uJ such that for each 5. = l,...,r, 

1 r 1 r 


(a) E 6, f„ (x)/ I 0. f^ (x) 

lelj i “1 JcJi J ‘'j 


n 

= r B. (x)/ s ^ (R * 

id ^ ^i jeJ J 


and 

(b) .T is sufficient for the family {f^ : k e of normal 

density functions. 


There is set of purely algebraic conditions which are equivalent to (b) ; 


namely » that the expressions 



n - n (T Q T^)“^ fi, 
\ \ 


m - Q (T Q T^)“H 


Q T^(T n T^)”^ 


are all Independent of k £ I, uJ 


(Peters, Redner, and Decell; 1976). 
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